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Abstract 

We define the e-distortion complexity of a set as the shortest pro- 
gram, running on a universal Turing machine, which produces this set 
at the precision e in the sense of Hausdorff distance. Then, we esti- 
mate the e-distortion complexity of various central Cantor sets on the 
line generated by iterated function systems (IFS's). In particular, the 
e-distortion complexity of a Cantor set depends, in general, on k 
and on its box counting dimension, contrarily to Cantor sets generated 
by polynomial IFS or random affinc Cantor sets. 

keywords: central Cantor sets, iterated function system, random Can- 
tor sets, scaling function, Cantor sets, box counting dimension. 

1 Introduction 

Nowadays, computers are being widely used to generate images in the anal- 
ysis and simulations of real-life processes and their mathematical models. A 
natural issue is to measure the complexity of drawing a set of points on a 
computer, which describes a continuous object at a given precision. Particu- 
lar examples of complex objects are fractal sets which arise in many contexts 
[3] . Well-known examples of fractal sets are strange attractors of dissipative 
dynamical systems and Julia sets. Another way to generate fractal sets is 
to use iterated function systems j^. 

The way we measure the complexity of a (compact) set can be collo- 
quially described as follows. We define the e-distortion complexity of a set 
^ as the minimal length of the programs producing a finite set e-close to 
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in the sense of HausdorfF distance. As in the classical notion of Kol- 
mogorov complexity of sequences, by programs we mean programs running 
on a universal Turing machine [H]. We are interested in the behavior of the 
e-distortion complexity when e is getting small, and in eventual relations of 
this behavior with other characteristics of the set {e.g., fractal dimension). 

In the present article we consider various classes of Cantor sets on the 
real line generated by iterated function systems (IFS's) li^ and compute 
bounds from above and below of their e-distortion complexity as a function 
of e. We first consider IFS's with polynomial contractions and obtain the 
upper bound const xlog(e^^) for the e-distortion complexity of the generated 
Cantor set, where the (finite) constant may depend on the polynomials. We 
can produce "many" polynomial IFS's with a lower bound of the same order 
using a probabilistic construction. It turns out that some particular Cantor 
sets like the usual middle third Cantor set are of much lower complexity. 
For analytic IFS's, we obtain the upper bound const x (log(e~^))^. Next 
we consider random central Cantor sets produced by affine IFS's, for which 
the contraction rate is chosen at random at each step of the construction. 
In this case, we get the upper bound const x (log(e~"'^))^ and the lower 
bound const x (log(e~^))^~'', for any > 0, for almost all such Cantor 
sets (where the constant in our bound depends on 5 and tends to when 
(5^0). Finally, we consider IFS's. Contrarily to the previous cases, the 
leading, asymptotic behavior of the e-distortion complexity depends on the 
box counting dimension D of the generated Cantor set. Indeed, we obtain 
the upper bound const x e^"^*^, for any 5 > (where the constant in our 
bound depends on 5 and blows up when (5^0). We then construct "many" 

(random) Cantor sets with a lower bound const x e^"*^^^ (for any 5 > 0), 
by constructing their scaling function |14j . 

The case of sets reduced to one point on the line was investigated in [7] 
where in particular the Hausdorff dimension of the set of reals with given 
asymptotic complexity is computed. For graphs of functions, from the point 
of view of determining the values of a function at given precision, relations 
with e-entropy are obtained in pQ. Another notion of complexity is to ask 
about the smallest execution time of the programs generating a given set 
with e-precision, in the sense of Hausdorff distance [I5j. This was used in 
[6] to show that a class of Julia sets was polynomial time computable. 

This article is organized as follows. In Section[2]we define the e-distortion 
complexity of a compact set and state our results. Section [3] is devoted to 
the proofs. 

2 Definitions and results 

For a compact set ^ C M'^, we define its e-distortion complexity as follows. 
Definition 2.1. The e-distortion complexity of a compact set G at 
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precision e > is defined by 



A(<r,e) = min{£(P) : du{C{P),'tf) < e} , 

where the minimum is taken over all binary programs P G {0, 1}* running 
on a universal Turing machine U, which produce a finite subset C(P) G W^; 
1{P) is the program length; denotes the Hausdorff distance. 

Notice that, because of the compactness of we can use a minimum in 
the above definition, which always leads to a finite number. 

For the reader's convenience, we recall that the Hausdorff distance 
between two closed subsets Fi ,F2 of a metric space with metric d is given 
by (see, e.g., mW 

dji{Fi,F2) = max{ sup d{x2,Fi), sup d{xi,F2)}. 

X2&F2 Xi£Fl 

Remark 2.1. If T is a bi-Lipschitz map, there exist two positive constants 
ci,C2 such that, if^ is a compact set, we have 

A(r(^),cie) < A(<^,e) < A(r(^),C2e). 

We now recall the definition of Cantor sets generated by iterated function 
systems p]. For the sake of simplicity, we restrict ourselves to Cantor sets in 
the unit interval [0, 1] C M, although several results can be easily generalised 
to arbitrary finite dimension. 

Let A = [0, 1] and let T be a finite set of indices with at least two 
elements. An Iterated Function System (IFS for short) is a collection 

{(t)i: A:i ^1} 

of injective contractions on A with uniform contraction rate p € (0, 1), and 
such that (i)i{A) n (l)j{A) = for i / j. 

For any infinite word uo G and for any n € N, let G T" denote the 
prefix of length n given by the first n symbols of w, and let 

</'^n := 0a;„ O Kr.-! ° " " " ° • (1) 

The map vr : X°° — > A defined by a; i— > 7r(a;) := HJ^o (^) continuous 
(in product topology) and, since 

diam {(l)^r.{A)) < diam (A), 
7r(u;) is a point in A for all u G X°°. The set 

oo 

weX°° n=0 
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is a Cantor set and satisfies 

We are interested in the behaviour of A(^, e) when e tends to zero. Note 
that this is a monotone decreasing function. 

Notation. In the sequel we write f g if there are two positive constants 
Ci and C2 such that for any e > small enough 

Cif{e)<g{e)<C2f{e). 

We write f ^ g if there is a positive constant C such that for any e > 
small enough 

f{e) < Cg{e). 



Our first result deals with polynomial IFS's. 

Theorem 2.1. Let he a Cantor set generated by an IFS with polynomial 
functions. Then 

A(<r,e) =^log(e-i). (3) 

Moreover, for any 5 > 0, there exist (many) polynomial IFS's such that the 
generated Cantor set satisfies 

(l-5)log(e-i) < ACr,e). (4) 

Remark 2.2. A more precise upper bound follows easily from the proof, 
namely 

A(^, £) < ^{l + deg 0,) j log{e-') + o(log(e-i)). 

A more precise lower bound of the same kind can also be obtained for a large 
class of Cantor sets which are generated by a set of full measure of some 
random polynomial IFS's (see below for the definition of a random IFS). 

Remark 2.3. The classical examples of Cantor sets are the middle ^-th 
Cantor sets in the unit interval (p = ^ gives the usual middle third Cantor 
set). They can be thought of as generated by IFS with affine contractions (t)Q 
and 4>i, and contraction rate p. For these Cantor sets, all we need for their 
construction is the knowledge of e and of p. We can choose e to be a number 
of low complexity. In the particular case where p is rational, A('^, e) grows 
slower than any unbounded partial recursive function. 

^We thank Arnaldo Mandel for this observation. 



4 



Remark 2.4. Notice that there are examples of Cantor sets with low e- 
distortion complexity which contain numbers of high complexity. This hap- 
pens for example in the middle third Cantor set. 

Our next result is about real analytic IFS's. 

Theorem 2.2. Let ^ be a Cantor set generated by an IFS with real analytic 
functions. Then 



We now turn to random affine IFS's, for which at each step of the con- 
struction we consider a random choice for the contraction rate. For the 
definition we follow [1]. 

Let us consider a family {Xk)k<^N of independent identically distributed 
random variables with values in the interval (0, 1). To each sequence A = 
(Afc)fceN we associate a Cantor set in the following way. Let := [0, 1]. 
We define 



jliX) :- 



0, 



Ai 



1 



Ai 



1 



and Cl := Ji^(A) U J2^(A). 



In words, C| is obtained by removing the central interval of length (1 — Ai) 
from C^. At the {k + l)-st step, we delete from each interval jf{X), i = 
1, . . . ,2^^, the central interval of length (1 — Afc+i), obtaining 2^~^^ intervals 



jf+i(A),z = L 



,2'=+^ such that 
, fc+i 



= 1, , 



(5) 



h=l 



Then we define 



2fc+l 

U -^^'w- 



i=l 



We call random central Cantor set the set 

oo 

:= n C\ 

k=0 



We remark that by construction the boundary points dJf{\) of all the in- 
tervals (A) are contained in '^\. 

The next theorem states that random central Cantor sets need more 
information than those generated by polynomial IFS's. 

Theorem 2.3. Let ^\ he a random central Cantor set as described above. 
Then, for any A G (0, 1)^, 



(6) 
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Moreover, let us assume that the common distribution of the i.i.d. random 
variables (A^) is absolutely continuous, with a density f{x) bounded above 
and below away from zero. Then, for any 5 > 0, we have 

(log(e-i))'-% A(<^;„e) (7) 

for almost every A G (0, 1)^. 

We now consider Cantor sets with a differ entiable structure. Following 
[H], [E] and [5], this corresponds to Cantor sets generated by IFS's and 
we call them Cantor sets. We shall recall their contraction in Subsection 

Eai 

Theorem 2.4. Let k > \. For any 5 > 0, for any Cantor set with 
box counting dimension D, we have 

^{^,e)^e-T-^. (8) 

Moreover, for any 6 > ^, there exist (many) central Cantor sets ^ with 
box counting dimension, at most D + 5, such that 

e-T+' ^A{'^,e). (9) 

We emphasise that in this case the asymptotic behaviour of the e- 
distortion complexity, when e tends to zero, depends in general on the reg- 
ularity k of the set and, contrarily to the previous cases, it also depends on 
its box counting dimension D. 

Remark 2.5. We notice that our proofs also provide estimates for the Kol- 
mogorov's e-entropy of some families of Cantor sets fWjj in the Hausdorff 
distance. 

3 Proofs 

The following two simple lemmas will be used repeatedly in the proofs here- 
after. We leave their elementary proof to the reader. 

Lemma 3.1. Let F and F' be closed subsets of A. Let I = [a, 6] and 

I' = [a\b'] be closed sub-intervals of A. Let H = [c,d] and H' = [c',d'] be 

o ° 

closed subsets of I and I' respectively. Assume that dH C F , dH' C F' , 

o ° 

Fr\ H= and F'D H'= 0. Moreover assume that there exists e > such that 
\a—a'\ < e, \b—b'\ < e, \c—d\ > 2e, \c'—d'\ > 2e anc? max {|c — c'|, \d — d'\} > 
e, then d^{F,F') > e. 

In the sequel, this lemma will be used to show that two Cantor sets {F 
and F') are a Hausdorff distance larger than e, H and H' playing the role 
of holes in the Cantor sets. 
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Lemma 3.2. Let (fi,^, P) be a probability space. Let ^ be a measurable 
map from {Vl^A) to the set of closed subsets of A equipped with the Borel a- 
algebra induced by the H aus dor jf metric. Let {ak)k be a positive, increasing, 
diverging sequence. Assume that for any integer k there exists a sequence 
{Vkj)i<j<2'^k of measurable subsets of Q such that 



{a;: A(^(a;),2-'=) <afc} C U Vfc,, 



and 



2"fc 



k j=i 

Then, for F-almost every uj, A(^(u;),2~^) > for any k large enough 
(depending on to). 



3.1 Proof of Theorem D 

Let be the largest degree of the polynomial functions {(/>«}, then we can 
write 



0<a<N 



with coefficients Ci^a £ 



We now show how to construct a program P 
approximating 'rf within Hausdorff distance e. 

Let e be fixed and K a constant to be specified later on. Let us define 
e' = We construct polynomials 



(j)i{x) := ^ a^aX" 

0<a<N 



y iGl 



with coefficients satisfying 



y i£l yo<a<N 



(10) 



such that the (pi^s are injective contractions on A with uniform contraction 
rate p £ (0, 1). For any G T" we construct the composition (p^„ as in ([T|). 

We first show that for any bounded set B cM such that 4'i{B) C B for 
all i £ I with the same contraction rate p, and (j)i{B) C B for all i £ I, we 
have for all n S N 



dM,.iB),4^^„iB))<e 



,l-p" 



sup 

P xeB 



E 

0<a<N 



V e J". 



:ii) 
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The proof is by induction. By (fTOj) and definition of d^, one immediately 
gets 



d^{4>i{B),4)i{B)) < e' sup 



0<a<Af 



Tlie inductive step is obtained by using the triangle inequality for dn- By 
the first step we have 



dn ( <i)u^M u-i{B)),(l)uA<P u-i{B)) ] < e' sup 



E 

0<Q<Af 



where we have used (j) (B) C B. Moreover, by using the contraction rate 
p, we get 

dn(<P.M .-AB))A.M „.AB))] <pd^{ct> ^_AB)A 



0<a<A' 



< e p — sup 

1 — P xes 



where the last inequality is the (n — l)-th step of the induction. Hence the 
triangle inequality implies that 



1 — jO"~^ 

d^i^^„ (B), (B)) <e' [l + p I sup 

J- — P / xe-B 



0<Q!<Af 



X 



This finishes the proof of (jlip . 

Let us choose n G N such that < e' and < e' . For this fixed n, let 
1/ := {0, 1} = a[0, 1] and define 

We now prove that d^{'i§', C) < e. Let us consider x S and y £ C. By ([2]) 
there exists z £ ^ such that x = 4'^^^^-^ i^) ^ given sequence ui^{x) G T". 
Hence 

d{x,y) = d{<j)^^^^^iz),y) < 
'^h(</'..,., (z), 0..,,, (V^)) + dH(0^.,,, {V), l^^^^ {V)) + dH(<^^^^^, {V),y). 



(12) 



For the first term we use the contraction properties to get 



du{<P^^^Jz),ct>^^^jV)) < p^' diam (A) < e'. 
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By (llip . for the second term we have 
If we take 



then 

d{x, y) < e'K + (iH(0^.(,, {V),y)=e + d^{4>^^^^^ {V),y). 
Choosing y ^ 6 - , iV) C C, we have 

Ml.jV),y) = 0, 
hence d{x, y) < e. Therefore 

sup d(x, C) < e. 

On the other hand for a given cj" G X" and y G (j)^^ {V), take j; G (/)^^ {A)r\'io, 
noticing that this set is not empty. Then we deduce that 

sup d{y, ^) < e. 
j/ec 

Hence d^{'^,C) < e. 

Let us define the program P that contains the numbers e, p and K, 
and such that it specifies the coefficients {cj^a}, computes n and makes the 
computation of the 4)i{Vys. The binary length ^(P) satisfies 

^(P) ^ log [{e')-^) =\ogK + log(e-i) . 

Indeed, e is specified with 0(loge^^) bits, and p and K do not depend on 
£ and can be approximated by rational numbers. The coefficients {q^q,} are 
approximations of the {cj^a} with precision e', hence we can choose them 
as rational numbers requiring only 0(log(e')~^) bits of information. Finally 
the information for the computation of n and C needs only 0(1) bits of 
information. Hence this proves 

We now prove ([!]). Let X = {0,1} and define (j)i{x) = hx, 4>2{x) = 
1 — bx for b £ (0, 1/2). We denote by the Cantor set generated by the 
IFS {(pi,(p2}- To be in the context of Lemma 13.21 we take b at random 
according to the uniform distribution on the interval (1/4, 1/3). We restrict 
the possible values of b to ensure that the middle hole is large enough so 
that an obviously simplified version of Lemma 13.11 applies. 
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For a fixed 6 £ (0, 1), define := (1 — 5)k. For any k, there are at most 
2"'k different binary programs (Pj)i<j<2''fc of length — 1, which generate 
at most 2"'= different sets Cj := C(Pj). We define 

Vk,j:= {b:d^{%,C,) <2-''Y 

Then 

{6: A(^5,2-^) <afc} C U Vkj. 

i=i 

We now estimate F{Vkj). We denote by d^Cj the rightmost point of Cj H 
(0,1/2). For k large enough (/c > 3) and for a given j, if b £ V^j then 
d{h,d+Qj) < 2-^ and therefore P(Vfcj) < 2^-^. This implies that 

A; j=l k k 

The result follows from Lemma 13.21 

Remark 3.1. Notice that this proof works also in arbitrary finite dimension. 



3.2 Proof of Theorem [272] 

We give the proof in the case that the are analytic functions on an open 
ball B{0,R) C C of radius R> 1. The general case follows by applying the 
same argument to piecewise polynomial approximations of the {(/>«}. 
By hypothesis we can write for z £ A 

oo 

(piiz) = E ^i,h y i £l 

h=0 

with coefficients Ci^h G C. By the analyticity of the functions {(pi} in -6(0, R) 
it follows that 

limsup \ci^h\ R'^ < 1 ■ 

Hence there exists a real constant r such that 

\ci^h\ R^ <r \f h>0. 

As above let us denote by y = {0, 1} = d[0, 1]. We now construct approxi- 
mations of the analytic functions {(pi}- 

Let 6 G (0, 1) be such that 6R > 1, and, for e fixed, define 
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Let be the smallest integer satisfying 



h=N 

Hence we construct the polynomials 

N~l 



^ 1-6 4r 



(14) 



Ci,h z' 



h 



h=0 



with coefficients c,- /, G C such that 



V i G T V /i = 0, 



.A^- 1 



(15) 



and the (j)i^s are contractions functions on A with 0j(^) C A for all i £ Z 
and uniform contraction rate /5 G (0, 1). 

Let us choose n G N such that p" < ^ and < For this fixed n 
we define 

^■■= U 

We now prove that duC^jC) < e. The proof will follow again by using (I12[ 
and the analogue of pT]) . 

For any bounded set B C 5(0, (^i?) such that (j)i{B) C S and (f>i{B) C 
we have for all n G N 



dMjB),4>^jB))<e 



I- 



V G X" . 



(16) 



The proof of (jl6p is by induction as the proof of (jlip . We only show the 
first step. By using (fTSl) and (fTHl . we obtain for all z G B{0,6R) 



i{z) - 4)i{z)\ 



N~l oo 

h - Ci,h) + E '^'-'^ 



E (^^^ 



/i=0 



Af-1 



/i=0 



h=N 



<il-p)e\e 



(_JosR\ 



+ 



4r{6R-l) ' 4r 

where in the last equality we have used (fT3]) . Hence 



(I-P) 
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The inductive step is obtained as in the proof of (jlip by the triangle in- 
equahty for du. 

We now write (I12p and, by repeating the argument of the proof of The- 
orem [2?T] and by (fT6]) . we obtain (iH('^, C) < e. 

Let us define the program P which contains the numbers e, R, r, 5, N 
and n, and such that it specifies the coefficients {ci,h} for h = 0, . . . , N — 1, 
and makes the computation of the (piivys. The binary length l(P) satisfies 

£(P)^iVlog ^(log(e-^))' . 

Indeed, e is specified with 0(loge~^) bits, and n x loge^^. Hence it 
is specified by o(loge~^) bits and N x loge~^, hence it is specified by 
o(loge^^) bits. The coefficients {ci^h} are approximations of the {ci^h} with 
precision e' , hence we can choose them as rational numbers requiring only 
0(log(e')~^) bits of information, but there are of them for each function 
(see and (fT^ ). hence we need 0(A^log(e')~^) bits of information, 
that is 0((log(e~^))^). Finally R, r and 6 do not depend on e, hence the 
information for them and for the computation of C need only 0(1) bits of 
information. Hence 

A(<r,e)^(log(e-i))' 

and the theorem is proved. 

3.3 Proof of Theorem EH 

We first prove ^ by constructing an approximation C of the set 

Let us consider a fixed sequence A G (0, l)'^ and the Cantor set ^a- For 
a fixed e, let n be given by 

n := min InGN: — <->. (17) 
\ 2" 2 J ^ ' 

Next, let us consider a sequence A = {Xk)k<=N £ (0, 1)^ such that |Ai — Ai| < e 
and 

|Afe - Afcl < 2*^-2 e yk = 2,...,n. (18) 
Then we define the approximation C of 'rfx to be the finite set 

2" 



C := dCf = \J ajf (A) 



i=l 



where the sets and J^^ (A) are constructed as specified in Section El We 
now prove that dii(jif\,C) < e. To this aim we show that 

dH(9jf(A),9J^(A))<| (19) 
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for all /c = 1, . . . , n. The same argument applies to all other sets . This 
is enough since it implies that, for any two points x and y G C in the 

analogous intervals (i.e., x G ^^id V S 9 J" (A) with the same index 

i = l,...,2"). 



d{x,y) < d{x,dJf{\))+d^{dJf{\),dJf{\)) + d{dJf{\),y) 
< 



where for the first term we have used ^ and (|17|) . for the second term we 
have used ()19p . and for the third term we have used the definition of C. 
Hence dii{'iax,C) < e. It remains to prove ([T9|). By definition 



h=l ) 




Hence it is enough to show that 



I 1 1 ~ 

( ^ n ^ n 



2k 



< 



(20) 



for all = 1, 



h=l h=l / 

, n. By definition of A, it holds 



1 



Ir 



£ 



Assuming that (|2Up holds for k — 1, one gets 



< 



2k 



k-l 



h=l 




1 1 ~ \ 

n ^ n 

h=l h=l / 



/ ^ k-l ^ k-1 

+ ^ ( ^ n ^ n 



h=l 



2k- 



h=l 



Afc 

2 



< 



2k-i 



£ £ 
+ 4 ^ 2' 



where we have used pSD for the first term, the inductive hypothesis for the 
second term, and the fact that A, A G (0, 1)^. 

To finish the proof of © we have to estimate the length of a binary 
program P producing the finite set C. The program P must contain the 
number e, the information to compute n, the contraction factors (A^) for 
k = 1, . . . , n and the instruction to compute C. The number of instructions 
for all the computations are 0(1) with respect to £. The number £ is specified 
by 0(loge~^) bits of information and n ~ loge~^ and each coefficient A^ 
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needs 0(loge ^) bits of information. Since there are n coefficients to be 
specified, we find 

^ {log{e-')f 

hence ([6]) foUows. 

We now prove ([7]). First of all we identify a full measure set of "good" 
A. By the hypothesis on the density /(x) of the common distribution of the 
random variables (Xk), the following quantity is finite 



7 := / log(x) /(x) dx < 0. 
Jo 

Notice that ^ can be interpreted as the typical contraction rate, since prod- 
ucts of many i.i.d. random variables will be involved. 
Given any r] > we define, for all n € N, 

An:=|AG(0,lf :nA;,>e'^(^-''),VA;=L^/^J,...,n| . (21) 

We remark that, since A^ G (0, 1) for all > 1, if A € A„ then 

k 

n A/. > ev^(^-'') VA: = 1,...,L^/^J -1. (22) 

h=l 

Lemma 3.3. Let us denote by A'^ the complement of An in (0,1)^, then 

n 

i.e., almost every A G (0, 1)^ belongs to only for finitely many n G N. 

Proof. We use the large deviation principle for independent and identically 
distributed random variables (see, e.g., [8|). It implies that for any fixed 
rj > there exists a positive constant C such that 

lim - logP < ; 1 < —Vr = —C- ■ 

k-^oo k I k J 

Hence, for n large enough and for all < C' < C , we have the estimate 

n f k ^ n 

nK) < E IP ] n < ^"^^'"^ \ ^ e"""'^ E e"""'^'-^^ • 

k=Y^\ U=l J fc=LVnJ 

Therefore 

P(A^) ^ e-^'^ 

and the lemma follows by the Borel-Cantelli Lemma. □ 
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For any e we define 

iV(e) := min{n G N : (2e''~^)"e > (log(e-^))"^} . (23) 
We now consider a subset of A^(£-). Let g G N and define 

:= |a G A^(2-,) : (1 - Xk+i) ^'=^ > 22-^ V A; = 0, . . . , iV(2-'?)| . 
Lemma 3.4. We have 

i.e., almost every A G (0, 1)^ /ies in on/y for finitely many g G N. 

Proof. By Lemma [3. 3 1 and the Borel-Cantelli Lemma, it is enougli to prove 
that 

P(^'gnA^(2-,)) <oo. 

9 



First observe that if < A: < y/N{¥^ and 



(1 - Afc+i) > 2^-^ 2^ {e^-r)^/m^) 

then A satisfies 



(l-A.+i)S^^>2^-'. (24) 



Similarly, if y^N(2F^ <k< N{2-i) and 

(l-A,.+i)>22-'' 2^ (e''-^)\ 
then dMI) holds. Therefore 

P{A G A^(2-,) \ < 2^-9 2V^(2-^) eC'?-7)V^(2-^) + 



0(1) 



i2 



which is summable over q. The lemma is proved. □ 
For a given e, let A G ^iog(£-i) and define 

Me,N{e)W ■■= ^log(.-i) n 

|Ai-Ai|<2e 

A: \Xk - Xk\ < 2^+He^-^)'^ e ii k = 2, . . . ,[VN\ - I \ (25) 
|Afc-Afc| < ^(2e''-^)'e if k = [Vn\, . . . , N 
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We now show that if A M(A) then dnC^Aj'^x) — ^- This follows from 
Lemma |3. II and we now check the hypothesis to be satisfied. 

If A € ^iog(£-i) \ M(A) then one of the conditions in (f25]) is violated. 
Following the notation of Lemma |3. 11 we start with I = I' = [0, 1]. We take 
H =[^,1- ^] and H' = 1 - Then \H\ > 2e and \H'\ > 2e since 



A and A are in "if 



log{e-i)- 



If I Ai - All > 2e, then 



2 



2 



> e, and Lemma 



O applies with F = "Ta and F' = "rf,, implying dnC'^A, "^a) > ^• 



Assume that, for some k = 2, . . . , [v A^J — 1, all conditions in ()25p are 
satisfied up to /c — 1 and condition k is violated. Either there is an £ < A; 
such that 



1 1 ~ 

^ n " ^ n 



2^ 



h=i h=i 
in which case we define k to be the smallest such 



Or, if no such i exists. 



^ n ~ ^ IT 

h=l h=l 



> 



Ai. — Ai 



2k 



k-1 

■ n ~ ^ 



h=l 
k-1 



k-1 



k-1 



h=l 



h=l 



> 



h=l 



where we have used (I22p and the fact that the leftmost positive points up 
to the {k — l)-th step of the construction are e-close, and we set k = k. 

We win apply Lemma O with / = jf"^(A) and /' = Ji'\X). We take 



and 



H 



H' 



2k 



Ha, 

h=l 



k-1 



h=l 




k-1 



h=l 



log{e-i), 



and we can 



We have \H\ > 2e and \H'\ > 2e since A and A are in ^ 
apply Lemma [3TT] which gives du(jif\,'if)J > £■ 

The same argument applies if the k-th condition with k = y/N, . . . ,N is 
violated, and all conditions up to /c — 1 are satisfied. If the leftmost positive 
points up to the (k — l)-th step of the construction are e-close, we get 



^ n - ^ n ^ 



h=l 



2k 



h=l 



>2e {e^~"ff-^l[Xh-e>s 



k-1 



h=l 



by definition (i2T]l of A7v(£) and using, as above, that all previous leftmost pos- 
itive points are e-close to each other. Again this implies that dnl'^A) '^x) > £• 
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Let us now estimate the measure of the set M£jv{e)('^)- By an easy 
computation based on the independence of the random variables (Afc), we 



obtain that for ah A G ^iog(e-i) 



< 



(2 e''-^)"^+°^^^^^^ = e-0(i)(iog(e-i))2^ (26) 

where we have used the definition ()23p of N[e). Note that this estimate is 
uniform in A E ^iog(e-i)- 

For a fixed 5 S (0,1), define Og := q^~^ . For any g, there are at most 
2"« different binary programs (Pj)i<j<2"9 of length — 1, which generate 
at most l"-" different sets Qj := We define 

Vq,, :={A:dH(<^A,C,)<2-''}. 

Then 

{A: A(<^A,2-^)<a,} C J ^gj- 

i=i 

We can write 

U y,,,- <P(A^^(2-,)) + P(^'^nA^(2-,)) + P(n,.n^,). 

Moreover if Vqj r\^q / 0, there is a A G such that Vqj r\^q C 
M2-9 7v(2-9)(^)- By Lemmas 13.31 and 13.4^ and by (p6]) it follows that 

EE w,.)<oo. 

The result follows from Lemma [37 
3.4 Proof of Theorem EH 

Preliminaries. We first recall the definition of the scaling function S^^ of 
a Cantor set 'rf ([Il],[l3]). In the sequel we fix T = {0, 1}. For a word 
w j' G we let 

Ju." := <A„n ([0, 1]) = cP^„ o cP^^_^ o . . . o </,^^([0, 1]) . 
Then by definition (pi{Ju)^) = J^j^i for any i £ I, and it holds 
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The scaling function describes the contraction rates in the previous inclu- 
sions. For a word G T'^ we define S^giuf-^) G (0, 1)^. The two components 
of 5'<g'(u;") are the rates of contractions 

where | J| denotes the length of the interval J. The length of the gap between 
the two intervals Joojj^ and J\^^ in J^^n can be reconstructed from these data. 
The scaling function is defined to be the function 



S^g ■ ^ (0, 1)2 

given by 

S^giuj) := lim S<:g{u:i). 

n— »oo 

We refer to [H] for the proof of the existence of this limit. 
By definition one has 

n-l 

l^rl = n(^''^K+i)) • 

By using the scaling function we can introduce a distance ds{uj,u}) on X°° 
in the following way. For two sequences uj,u! € let uj Dui denote their 
longest common prefix, and let \uj D u)| denote its length. Then we let 

n=\uir\Li>\ 

dsiu:,^) := sup Yl q))^, . (27) 

Then there exists a constant H > such that for any uj,u) £ X°° it holds 

Relations between the properties of the scaling function of a central 
Cantor set and the differentiability of the IFS generating this Cantor set 
have been studied in [T3] in the case k > 1 (we refer the reader to Main 
Theorem [13], page 406). The idea is the following. Let A{oji) denote 
the set of the four boundary points of the intervals {Jiuj^)iex- A scaling 
function S^g generates a Cantor set 'to if and only if for any n G N there 
are diffeomorphisms from A{uji) into ^(u)"), for any 7^ cDi G X", with 
derivatives bounded by a constant C(u;",a;") which satisfies 

C(a;^ cD^*) = C dsiuj'^a, (Jj-^af-^ (28) 

where C does not depend on n and from the definition of ds the right hand 
side is independent on a G X°°. 
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Proof of Theorem 12.41 We first prove ([8]). Let e be fixed. We sliow 
how to approximate the set ^ within Hausdorff distance e. We will give the 
proof for integer A: > 1. The proof easily extends to functions whose k-th. 
derivative is Holder. 

We can write the Taylor expansions of the maps at a point xq 

k-l 

(pi{x) = ^ Ci,p(xo) {x - xqY + Ri{x, xo) V X G [0, 1]. 

p=0 

Moreover there exists a constant K > such that \Ri{x,xo)\ < K\x — xq]'' 
for X S [0, 1], for all i £ I and xq G [0, 1]. 

Let e' = e for a constant M to be specified later on. We now 
construct a sequence of polynomials which approximate the maps ((/>j)jgi. 
If D is the box counting dimension of the Cantor set we need for any 
(5 > at most N = 0{{e')^^~^) intervals {Is)s=i,...,N of size (e')* to cover 

Hence we can consider the maps restricted to the sets (I^). If Us 

denotes the middle point of the interval I^, let ys be the approximation of 
the point ys within a distance e' . Then we define 

fc 

= - VsY V X G [0, 1] 

such that 

\ci,p{ys) - Ci^p{ys)\ < e' M i £l M p = . . . ,k (29) 

and they are contractions on M with the same uniform contraction rate 
~p<l. 

To construct an approximation of we work on the boundary points of 
the intervals J^^ which all are in ^. Let us denote J^^ = [y^n , y^n] . Since for 
any n G N and any G T" we have y^n G ^ for r] = 1,2, we can associate 
to a given a sequence Co""*^ G {!)••• ,-/V}"~^ which specifies to which 
intervals of the cover {Is)s=i,...,N the pre-images y''„_i,y''„_2, • ■ ■ ,y2>i,y2 of 
belong, where yj' G {0, 1}. 

We now establish the analogue of (llip for the boundary points. Let us 
define 

then for all n G N it holds 

< |^/c + l + i^+ .^^max ^^j; Va;^ GX'^. 

(30) 
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The proof is by induction. The first step (n = 1) follows by definition of the 
approximating polynomials, estimates ()29p and properties of the remainder 
Ri{x,xo). This yields 

k k 

< e'Y^ie'f^+Y. hAy<romy]-y<ror-{y]-y.or\ + Ke' 

p=0 p=0 



l + K + max V \ciAys)\ e' . 

iG2, s=l,...,N — ' / 



< u 

iPT. .9=1 N 

p=0 



The inductive step follows by using the triangle inequality 
together with 



\(l^u.Ay\-i) - CrHy\-i)l < e' U + l + K + max V \ci,p{ys 

^1 ^1 \ t£l, s=l,...,N ^ — ^ 

and 



p=0 



where p is the uniform contraction rate of the approximating polynomials. 
Let us choose n such that < | for all n > n. Then we define the set 

c:= U {yk^yk) 

and we claim that dii{'^, C) < e. Indeed, by definition of n, any point in the 
Cantor set "j^" is at most at distance | from a point in the boundary of one 
of the sets J^^n. Moreover, by construction of the points y^n we have (pOl) . 
hence the claim follows since 

dH('^,C)<sup inf Ix-y^'^l 
< sup inf (\x -ylu\ + lylu - yln\) < e 



provided that we choose 
/ 

M:=2 A; + l + i^+ max ^ \ciJys 
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Let us define the program P that contains the numbers e, p, D, M, K 
and A;, and such that it specifies all the necessary coefficients Q^p, makes the 
computation to obtain N and the approximated points y<j, and moreover it 
makes the computations to obtain n and the points y^^ . The binary length 
^(P) satisfies 

since e is specified with 0(log(e~^)) bits; p, D, M, K and k do not depend 
on £ and can be approximated by rational numbers. The coefficients Cj^p 
and the points ys are constructed as in the previous proofs with precision e', 
hence each of them needs 0(log(e~^)) bits of information and their number 
is Nik + 2) = 0{e~~~^). Finally all the computations to obtain C need 
bits of instructions. Hence ^ follows. 
We now prove We define a class of particular scaling functions 

S{a) on X°° to construct differentiable Cantor sets with the given distortion 
complexity. 

Let us denote by X* := U„£n ^T" the countable set of finite strings s 
written using the alphabet X. Let (As)sgj* be a family of independent 
identically distributed random variables with values in the interval (0, 1) 
and absolutely continuous distribution with density /(x) bounded above 
and below away from zero. Note that the empty string [j belongs to I* and 
therefore there is an associated random variable Ajj. 

Let 0<C<l)0<p<l and p < 6* < 1 be given constants, with p 
determining the contraction rate. Q will be chosen small enough later on. 
We will only consider central Cantor sets, namely the two components of 
the scaling functions will be equal. We define the scaling function 

oo 

Sx{a) := p + ^"'^ K\ VaGT°° 

which depends on the realisation of the family (A^). We remark that for any 
realisation it holds 

P<Sx{a)<p+-^^ ya€l'^. (31) 

Hence if C is small enough, the rate of contraction is almost p. It is also 
useful to define the truncated scaling function Sx by 

m 

Sx{u^T)--=P + CT. ^^t Vm>0. (32) 

q=l 

Using the relations 

^^ = 5A(^n ViGT (33) 
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we can construct a central Cantor set generated by the scaling function 
S\. From (j3ip it follows that the Cantor set ^\ has box counting dimension 
which satisfies 

^(C) = - {^ + 0(0 as (34) 
logp 

We now consider the difi'erentiability of the IFS generating '^\. By (i3T]) 
and the definition ([77]) of dg it follows 



n = w n a) 



for any LiJ,a; S X°°. Moreover for any 7^ u) it holds 

g|n+l 

|5A(w)-5A(i')| <2C n=|c^nLD|. 

Hence for m > n we have diffeomorphisms from ^(u;™) into ^(tDj") with 
derivatives bounded by a constant C(u;™,(Z'^) = 0{6"'~^^). These facts to- 
gether with relation ([28]) imply that the Cantor set "^a is of class with 

^ = 1 + |^^>1. (35) 
logp 

Let < e < 1 be fixed and small enough depending on the constants 
p,6,C- Let A := (A^) and A' := (A^) denote two different realisations of 
the family of random variables. We give a condition on A and A' to have 
ciH(^A,^A') > £• We denote 



P 



log(Ce C"') 



log{p9) 



(36) 



where C is a positive constant (independent of e) to be specified later on. 

For any a £ I* , we denote by and the intervals associated to a in 
"^A and ^A' respectively. 

Lemma 3.5. Assume there is < p < p satisfying 

max dn{J^p,J'^p) > e. (37) 

Then d^{'^x,'^\') > e. 

Proof. Denote by p the smallest integer for which the above inequality holds 
and by G I'p the string realising the maximum. We apply Lemma 13.11 
with / = J^p-i and /' = J' p-i- The hypotheses on I and /' follow by the 

fact that (f371) is violated up to p — 1. The gaps H and H' have size at least 

pP-^ > {pOy = Ce C'^ > 2e 
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for e small enough if ^ > 2. 

Finally since dH(J^p, J'^p) > e we have all the hypotheses of Lemma [3. 11 
Hence the lemma follows. □ 

Lemma 3.6. Assume that there exists < p < p and a sequence uj^ G 
such that 

Then dn{'rfx,'^\') > e. 

Proof. Denote by p the smallest integer for which the above inequality 
holds. It is enough to assume that for any q < p condition (f3711 is not 
verified, otherwise the proof follows immediately from Lemma |3.5[ 

Let X (respectively x') be the point in the boundary of J^p (respectively 
J'^p) which is not in the boundary of J^p (respectively J'^p)- Let y (respec- 
tively y') be the other boundary point of J^p (respectively J'p). Since by 
the recursive assumption \y — y'\ < e, we have 



\x — x'\ > \x — y — {x' — y')\ — s. 

Now since |x — 2/| = \J^p\ and (x — y) and {x' — y') have the same sign, we 
get 

\x-x'\ > ||J^p| - |J;;,p|| -e 
This can also be written 

\x-x'\ > \Sx{ujI)\J^p \ - Sx'{ujI)\JIp\ \ -e> 



> Pa(^2) - '^A'(^^2)| 2 2 '^2' ~ ' "^2' 



e > 



since dn ( Jui'' ^ J' p) ^ ^ ^^'^ ^ P- From. ([32]) we get 



9 
2 

g=2 



and from (|33|) we have 



q 

"2 '^2 

q=2 



2 I"/..?-! I 1"^' p-1 I 

^2 1^5 

_i 1 — 2 
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Since |J,p-i| < iJ^p-A < {p+j^Y ^ we get (using again \^Juj^,J'^pj < 
and S\ > p) 



q=2 



<2 p 



-p+2 



e . 



Hence 



We conclude that 

\x-x'\>C{per \X^P-Xlp\-{3 + 2p')e. 

Therefore the lemma follows by applying Lemma □ 
We now want to estimate for a given realisation A of the family (A^) the 
probability of the event 



(S'p{\) = sup {pOy sup |A(j — A'o-I < 

By independence of the family (A^) we get 

p 



(4 + 2p^) e 
C 



where 



'(4(A)) <n(^ ip^^y^c) 

q=0 

C={4 + 2p2) sup fix) 



^2" 



where / is the density of the random variables (As 
These relations imply that 



logP(4(A)) < 2'?(log(Cer')-glog(/oe)) 
< 2P+Mog(p0)-(3+p)log(p0). 

For a fixed 6 G (0, 1), define ai := 2^'^~^~^~k^ and choose e = 2~^. For any i, 
there are at most 2'** different binary programs (Pj)i<j<2''^ of length — 1, 
which generate at most 2"^ different sets Cj := C(Pj). We define 



Vt,j:=[\:d^{^x,C,)<2-'Y 



Then 



{A: A(^A,2-0 <a£} C J 

i=i 
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If Vij is not empty, there exists A such that Vij C <%(A). Then 



< 0(1) exp (2^("'^+t) log 2 + 2P+1 log(p0) - (3 + p) log(p0) 



Using ([3l]) . ([35]) and ([SG]) . it follows that for any S > one can find C > 
small enough such that this is summable in i. Hence we can apply Lemma 
to complete the proof. □ 
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